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ABSTRACT 


ENpirical Orthogonal) Pumet renmpee.) analysis is used to 
represent the environmental wind forcing of selected western 
North Pacific tropical(G clone itr eacs rot eee The 
EOF analysis is applied separately to the zonal and meri- 
dional wind components at 700, 400 and 250 mb on a 527—point 
grid with 288.7 km zonal and meridional spacing that is 
relocated with the stcerm center. The 527 EOF coefficients 
(for each level and component) are computed for a sample of 
682 cases. The coefficient vectors are truncated to the 
first 35 coefficients based cn a #Honte Carlo selection 
Criterion, These coefficients account for at least 82 
percent of the variance in each field. The EOF cocefrim 
Clients, along with storm movement during the past 24 hours, 
position, date and intensitv, are then used as potential 
predictors ina regression analysis forecast scheme for 
tropical cyclone motion. The ECF-based regression eyuations 
are tested on the dependent data cases. The mean 72-hour 
track forecast error is between 450 and 500 kn. Therefore, 
it appears that this regressicn scheme has potential for 


operational applications. 
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A. BACKGROUND 


Piisestldyeomre laced. to one Of Eho most diftricult prob- 


lems in tropical meteorology--to forecast the movement of 


eEapical cyclones. In discussing the impact of weather on 
naval forces, materials and operations, Wells (1982) empha- 
sized the role of tropical cyclones. Avoidance of tropical 


@aermones 1S important to both military and civilian popula- 
e2LOn S « Fleet operating orders contain lengthy, explicit 
guidance on tropical cyclone evasion. Yet, serious losses 
due to tropical cyclones continue to occur. Because of the 
potential devastation of life and property, continued 
improvement in the ability tc forecast tropical cyclone 
movement 1S imperative. The guidance to avoid storm damage 
is available, but precautions must be taken early. This 
requires accurate tropical cyclone forecast methodology. 
After George and Gray (1976), tropical cyclone movement 
prediction models can be classified into four categories: 
(1) steering fiow; (2) statistical; (3) nmumerical; and 
(4) climatology-persistence. The steering concept treats 
tropical cyclones as vortices embedded in the basic environ- 
mental flow. The statistical forecast approach commonly 
uSeS a screening procedure to select meteoroiogical vari- 
ables that are correlated with tropical cyclone movement. 
These variables are then used to develop regression equa- 
meems LOL prediction. An analcg-statistical model is Lased 
upon the assumption that historical families of repetitive 
storm tracks are associated with repetitive synoptic 
patterns. By scanning historical data records, a computer 


algorithm is used to associaté an existing storm with a 


"parent" storm track o©F with a family Of (Sin@tar stoner 
The numerical method involves predictions of the synoptic 
flow surrounding a cyclone, jand possibly a sSimulationge. 
cyclone structure, to predict storm movement. Prediction of 
tropical cyclone movement cLased on climatology, “and on 
persistence is based upon empirical relationships derived 
from historical records of the tracks of previous cyclones. 
Objective methods for forecasting tropical cyclone movement 
have been developed using one or more of these prediction 
models. As yet, no one of these objective technigues has 
been found to be superior to the others under all conditions 
(e. g., Neumann and Pelissier, 1981). 

The simplest numerical method of predicting tropical 
cyclone movement 1s to use a [Larotropic model on a rela- 
tively coarse grid (Sanders and Burpee, 1968) with a point 
vortex advection scheme (Renard, ToGo. Results obtained 
from these methods demonstrated that there 1s considerable 
information in the analyzed and predicted synoptic fields 
represented on grids which lack the fine resolution neces- 
Sary to resolve the intense wind field near the center of a 
trOpHcatee  Cllome. However, Ley and Elsberry (1976) cited 
these models as inadequate due to the lack of a unique 
steering level (or layer) ang the absence of vortex- 
environmental interaction. Still, the relative success of 
coarse-mesh models supports the idea that it might be 
possible to relate large-scaie forcing (by advective 
processes) of a tropical cyclone to its subsequent movenent. 

Current Statistical models for the prediction of trop- 
ical cyclone movement use predictors derived from clina- 
tology, persistence and either observed or numerically 
Forecast geopotential height datan(such was sgrad fe Wes esr ici 
hesses and time changes). For example, Neumann and 
Randrianarison (1976) developed a purely statistical model 


based on a system of regression equations for the prediction 


of tropical cyclone movement over the Southwest Indian 
Ocean. Basically, the model is CLImatology plus PEFsistence 
(CLIPER), applied to the Indian Ocean. Stepwise regression 
was used to develop second-order polynomials (35 variables) 
to predict the zonal and meridional cyclone displacements. 
The resultant model's performance compared favorably with 
operational models ({Jcint Typhoon Warning Center, 1983). A 
ea@mericant number of North Atlartic tropical cyclones 
exhibit anomalous motion characteristics (Neumann, 1981). 
The forecast tracks of these storms revealed limitations of 
purely statistical forecast systems (Neumann and Lawrence, 
ie 5) - While some researchers sougnt to develop purely 
oymamical models (e. g., Miller et al., 1972), others devel- 
oped Statistical-dynamical mcdels. The current Nie 
statistical-dynamicai model, NHC73, was described by Neumann 
and Lawrence (1975). The results demonstrated that informa- 


tion obtained from numerical prognoses can improve the 


performance of statistical tropical cyclone prediction 
models. 
Statistical models for the predicwon Ger tropical 


cyclone movement have traditionally used a coordinate system 
oriented with respect to the zonal and meridional axes. 
Tropical cyclones tend to move with the synoptic flow. 
Short-term displacements have a very strong persistence 
component. For these reasons, a grid system oriented with 
meopect to the cyclone'’s heading would be a natural choice. 
Shapiro and Neumann (1984) investigated the error-reducing 
potential of a grid system oriented with respect to the 
cyclone heading. This grid-reorientation technigue resuited 
in a 40 percent reduction of the total variance of tropical 
cyclone movement. it was shown, using the depencent data 
sample, that a potential reduction of 24-hour forecast 
errors by approximately 13 percent could be realized for 


Synoptic predictors extracted on a rotated grid. This 


reduction in error waS comparable to the reduction a 
Z4-hour forecast errors during the past 25 years (Shapime 
and Neumann, 1984). Tt was observed that the entire reduc- 
tion of forecast error iS not realizable due to random and 
real errors in the developmental and operational heiyht 
data, respectively. Satisfactcry results were not obtained 
using rotated grids for prediction of 48- and 72-hour trope 
ical cyclone movements. An analysis of forecast results 
revealed that grid rotation cptimized forecasts in the 
direction along which the variance of tropical cyclone nove- 
ment 1S maximized and tended to orient the displacement 
vectors with the along-track direction. The results o£ 
Shapiro and Neumann (1984) indicated the potential forecast 
improvement that can be made in Short-term forecasts with 
current synoptic data if the cyclone's headitg is Known 
However, these concepts must still be tested in an opera- 
tional environment. For this reason, the data grid used in 
this study was geographically oriented. This grid systen 
will be described in the next chapter. 

Both statistical and dynamicai methods have weaknesses 
(Haltiner and Williams, 1980; Shaffer, 1982). Statistwceam 
metnods usually do not forecast well those cyclones that 
have anomalous motions. This problem relates to the "scope" 
of a model, as discussed in Chapter V. Similarly, these 
methods are typically not robust against small changes in 
the synoptic (dynamic) LOLCINgG 4Ol fac yelome: There is a 
general tendency of statistical methods toward homogenized, 
Or smoothed, forecasts. In comparison, dynamical models 
suffer from both theoretical and financial limitations. Due 
to the smallness of the Coriolis parameter ain tropical 
regions, geostrophy cannot be assumed and initialization of 
data fields is difficult. Erroneous data used to inatgaleee 
a model can rapidly deteriorate a wnutlerieal forecasts 


Convective heating is one of the primary driving mechanisms 
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for the maintenance of a tropical cyclone. fee aGirtfeiculty 
of modeling convective heating, tojgetner with initialization 
problems, makes dynamical model predictions suspect in the 
teepics. More importantly, maintenance of the energy 
balance for a tropical cyclone reguires an interaction among 
different scales of motion (Coyama, 193 2) fO avy Ola 
spurious solutions, a small grid mesh is necessary fora 
dynamicai model to numerically simulate these interactions. 
Furthermore, the expense of numerical integration on a fine 
mesh can be yguite large due to the Courant-Fredrichs-Levy 
(CFL) condition, which requires integration to be made with 
smaller time steps as the mesh is decreased (Haltiner and 
fieeertans, 1980). An additional difficulty encountered with 
a fine-mesh model is that observed data in tropical regions 
are inadequate for model initialization. 

Neumann and Pelissier (1981) studied the performance 
characteristics of various tropical cyelone movement 
prediction models in operaticnal use at the National 
Hurricane Center (NHC) in Miami, Florida. These models are 
representative of the current methodology for prediction of 
tropicai cyclone movement. The seven models range in 
complexity from the Lasic analog to the sophisticated numer- 
ical and are identified in Table [I as statistical, 
Statistical~-synoptic, Statistical-dynamical or dynamical. 


Four of the statistical schemes are regression-eguation 


models. Predictors for these eyuations are generally 
derived from climatology, persistence and geopotential 
height data (except CLIPER). A fifth statistical model 
(HURRAN) uses an analog approach. Operational analysis of 


CLIPER and HURRAN (Hope and Neumann, 1970) showed that each 
of these models gives almost identical forecast tracks. 

For torecasting western North Pacific tropical cyclones, 
five main categories of objective technigues are used by the 


Joint Typhoon Warning Center (JTWC), Guam, Marianas Isiands: 
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(1) climatological and analog techniques; (2) extrapolation, 
(3) steering techniques; (4) dynamic models; and (5) empir- 
ical and analytical technigues. A brief description of the 
objective technigues used 1s given in the annual treport 
(Joint Typhoon Warning Cencer, ios 3)- In contrast to the 
NHC, the JTWC has not placed emphasis on the development of 
statistical methods. The variety and range of sophistica- 
tion of techniques in operaticnal use at the NHC and the 
JTWC for objective forecasting of tropical cyclone movement 
1s noteworthy. That simple methods such as mere extrapola- 
tion are competetive with complicated numerical models might 
ke taken aS a Surprising indication that little progress has 
been made aye the improvement on forecast ski lig 
Alternately, the indication could be that tropical cyclones 
are not predictable solely by use of a single class of 
methods. For the years 1972-1983, tne magnitude of the 
track forecast error by the JTWC for western North Pacific 
ELOD TCA cyclones waS approximately 213 kn (115 noe 
497 km (220 n mi) and 667 km (360 n mi) £ Or the 
24-, 48- and 72-hour, respectively (Joint Typhoon Warning 
Cent @r jaan 9 C3 )e. Improvement over these forecast errors is 


seen to be a realistic goal. 


Be “OBJECTIVES 


The main objective of this study is to develop a 
"statistical-climatolcgical" method to forecast tropicam 
cyclone movement. However, ccmputational requirements for 
the development of a regression model from a large synoptic 
grid system limits the humter of possible grid-point 
predictors. An Empirical Orthogonal Function (EOF) approach 
Similar to that used by Shaffer and Elsberry (1982) is 
therefore adopted. If an attempt were made to develop a 


regression model using a large synoptic grid system, the 
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PimoerNOr GCrid—-point predictors would be prohibitive. The 
difriculties inherent in both statistical and dynamical 
methods motivated Shaffer (1982) and Shaffer and Elsberry 
(er S 2) CemmCewe hoped sesStadtistical—climatological tropical 
cyclone track prediction technigue using an EOF representa- 
meron or the synoptic forcing. the EOFS provided an alterna- 
tive to grid-point predictors. The techniygue enabled the 
representation of fields of 120 grid points by 10 eigenvec- 
tors and their associated EOF coefficients. Emgnty-fL£ive 
percent of the total variance of the data was accounted for 
Ey these 10 modes. Shapiro and Neumann (1984) also used 10 


modes to account for 98 percent of the total variance in 


geopotential height data in either a rotated OF 
geographically-oriented grid systen. These advantages of 
data reduction and simple numerical representation of 


synoptic fields make the EOF technigue ideal to use with 
regression analysis. The eigenvectors represented different 
patterns relating to tropical cyclone movement; that as, 
patterns which appeared to be physically important in the 
determination of tropical cyclone movement. This approach 
was novel for forecasting of storm movement in the sense 
that previous regression analysis methods (e. g., Neumann 
and Lawrence, 1973) had not inccrporated the entire synoptic 
Pomeiung fleld. 

PieetneeayNOptic Clow Surreunding a tropical cyclone is 
a mMajcr determinant of cyclone movement nas been long 
observed (Chan and Gray, 1982). In particular, it has been 
well established that tropical cyclone movement is signifi- 
cantly related to mid-tropospheric surrounding wind patterns 
(Chan et al., 1980). Neumann and Lawrence (1975) associated 
most of the variance reduction by statistical models for 
prediction of tropical cyclone movement with input fron 
three sources: (1) climatology and/or persistence; (2) some 


type of "steering"; and (3) the position and intensity o£ 
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the synoptic-scale features adjacent to the trovical 
Cyclone. One approach is to consider the cvclone to bea 
point vortex whose direction and speed are approximated by 
the direction and speed of the surrounding winds (02, eqguiige 
alently, the pressure or height jradients across the 
cyclone). The steering level is that pressure level at 
which the wind speed and direction best correlate with those 
of the cyclone. The steering level theory has been applied 
in several tropical cyclone movement forecast schemes; for 
example, Riehl and Shafer (1944), Miller and Moore (1960), 
Tse (1966) and Renard et al. Cioeiee Different steering 
levels are used by the various forecast schemes. However, 
the general concensus is that the mid-tropospheric levels 
(700 mb and 500 mb) are the best for predicting tropical 
cyclone movement (Chan and Gray, 1982). The upper tropos- 
pheric level winds have not been found to be useful for 
tropical cyclone movement prediction (Jordan, 1952; Miller, 
io sone 

Statistical regression eguations were developed by 
Shaffer (1982) to predict the zonal and meridional displace- 
ments of tropical cyclones at 12-hour intervals to 84 hours. 
Eof coefficients of the dependent sample were used to repre- 
sent the synoptic forcing in the eyuations. Forecast errors 
were competetive with other statistical methods. The 
average vector displacement errcer for an independent sanple 
waS approximately 17 percent smaller than the iong-tern 
average official JTWC forecasts. The best overall forecasts 
were obtained uSing e€yguations derived with 500 mb height 


data. For these eguations, the vector displacement forecast 


errors obtained ote the independent Sample were 
164 km (68 n mi), “3327 km "(76 nen and 513 km (277 7 aos 
for 24-, 48- and 72-hour forecasts, respectively. It is 


noted that a shortcoming of Shaffer (1982) was the smallness 


of the independent sample, but the study demonstrated that 
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an ECF-based reyression approach can provide ae simple, 
tom—-cost technique icr CE CeMer ron Or PEOoted I. Cyclone 
motion. 

The extent to which the surrounding flow can be used to 
predict tropical cyclone movement has been explored by 
Studies such as Shaffer and Elsberry (1982) andis a key 
motive for this study. The motion of a tropical cyclone is 
not determined solely by forces acting on one pressure level 
but rather by the mean wind flcw integrated through a deep 
layer and over a substantial area surrounding the cyclone 
(Miller and Moore, 1960). Because a Single steering level 
has not been established, these rejression studies involve a 
Single-~level model that is tested with predictors extracted 
on three different leveis. The primary purpose of the 
current study is to use analyzed wind fields to represent 
Synoptic forcing in a tropical cyclone movement forecast 
technigue. Both Shapiro and Neumann (1984) and Shatfer and 
Elsberry (1982) worked with geopotential height data. 
Because the wind fields are generally more representative of 
the flow in the tropics, it is hypothesized that: a study 
Similar to Shaffer's using wind data could result in further 
improvement of forecast ability. 

The techniques that have been applied are not new. The 
uniqueness of the new forecast scheme is the use of an EOF 
representation of the wind forcing in the prediction of 
tropical cyclone movement. This forecast method can be 
Besetibed as a Statistical-climatological tropical cycione 
forecast method which uses an EOF representation of the 
synoptic-scale wind forcing. 

Chapter II discusses the acquisition of data and the 
grid system used. The EOF methodology and analysis are 
described in Chapters III and IV. In Chapter V, the 
resultant equations from a regression analysis are used to 


develop a prototype forecast schene. Future storm positions 


1S 


are determined by regression equations for the orthogonal 
components of motion. Chapter VI addresses the important 
guestion of applicability of the model for independent data. 
The concluding Chapter VII contains suggestions for further 


research. 
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IIT. DATA ACQUISITION AND FLELD DEFINITION 

Wind data used in the present study are from the Global 
Band Analyses (GBA), which are operationally generated by 
the United States Navy Fleet Numerical Oceanography Center 
CANOC) . The GBA are produced cn a 49 x 144 Mercator grid. 
Ate 22.5 N or S, the grid mesh distance is 257 Kn. The GBA 
provide complete longitudinal coverage over latitudes 
me 956 S to 59.745 N. Grid pcints are always separated by 
2.5 degrees of longitude. Hcwever, convergence of the 
meridians causes the actual zonal distance separating grid 
points to decrease toward higher latitudes. Along the 
Memenern boundary of the GBA grid from 58.462 N to 59.745 N, 
the longitudinal separation of tne grid points undergoes a 
3.7 percent decrease. hiss SiOouLd.s NOt. be an important 
source of error given the inherent uncertainties of the raw 
datid « The GBA were available for the period 0000 GMT 5 
January 1975 to 1200 GMT 31 December 1983. Data were avail- 
aeme at 0000 GMT and 1200 GMT for the zonal and meridional 
wind components at the following levels: sSuriace, 
mommem>, 400 mb, 250 mb and 200 mb. It is noted that the GBA 
are missing for some dates and times at one or more levels. 

Data for western North Pacific tropical cyclones are 
available from the annual tropical cyclone reports of the 
JTWC. At six-hour intervals, warnirjg position, best track 
position, estimated intensity (maximum sustained wind speed 
and minimum surface pressure) are given, as well as fore- 
casts for 24, 48 and 7/72 hours. The JTWC annual reports for 
the years 1979 to 1983 were used to select the cases used in 
this study. To apply the technigue proposed in Chapter I, 
the following conditions for case selection were imposed. A 


Mame ical €yelone which matured to at ieast tropical storn 
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intensity (maximum sustained winds of 18 m/s (35 kts) or 
greater) must have been present west of the dateline. The 
JTWC warhing position at the times the GBA were produced 
must have been at a latitude less than 34.6 N to ensure that 
data were available for a sufficient latitudinal extent 
north of the cyclone center. Papacy, the GBA must be 
available for the zonal and meridional wind components at 
700 mb, 400 mh and=250 mb. 

A total of 1357 cases were found to meet the above 
criteria. Because of computation-time limitations subse- 
guently encountered, the initial data set was later reduced 
to 682 cases by random selection. These 682 cases comprised 
the data set from which the EOF functions were computed. 
However, all 682 cases were not Suitabdle for the regression 
analysis due to aninadequate history or future storm 
EeCcord: The selection of cases for the regression analysis 
will Ee described in Chapter V. 

A relocatable 527-point grid was defined with a fixed 
zonal and meridional separation of 277.8 km (150 nm m1). ee 
grid in Fig. 11S typical. There are 31 grid points westmme 
east and 17 south to noren- The horizontal resolution is 
twice that. of *Shafier ei s7Z with about 4 1/3 times the 
number of grid ports (527 viceaeZe.. The equidistant grid 
extends 8334 km (4500 n mi) zonally and 4445 km (2400 n m1) 


meridionally. The grid is moved for each case so that the 
trepicas cyclone center is always located at the 
(O70 ) Soiree Doin For each case, the zonal and meridional 


wind speeds at 700 mb, 400 mb and 250 mb were were extracted 
from the GBA onto the eguidistant grid using a bilinear 
interpolation method (on a spherical Earth). The warning 
position from the JTWC was used to locate the cyclone 
CentecLE-. 

Contours of the mean and standard deviation fields of 


the zonal and meridicnal winds at 700 mb, 400 mb and 250 mb 
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SeensckhOwnl an Figs. 1-6. The means and standard deviations 
were based on ali 682 cases. LomocioOwieal Fieioe Io, the 
Variability of the winds is iargest in the northeast quad- 
rant of the equidistant grid at all three levels. Because 
the variability of wind speed is not uniform throughout the 
grid, standardization of the winds by the mean and standard 
deviation at each grid point is essential to ensure that 
regions of the grid where variakility is generally higher do 
not "dominate" in an EOF analysis. The standardization of 
data wili be presented Chapter III. 

The mean zonal and meridional flow patterns at 700 ob 
appear to be physically reasonatle. The mean zonal flow in 
Fig. 1 shows easterlies (westerlies) to the north (south) of 
the storm center. Although the grid resolution does not 
reveal the fine structure of the storm, the cyclonic envi- 
ronment of the storm is evident. The mean meridional wind 
component in Fig. 21s dominated by southerly (northerly) 
Ilow to the east (west) of the storm. Again, the cy cionie 
Shear envelope of the storm can be clearly identified. The 
mean zonal wind fields (Figs. 1, 3 and 5) show significant 
strengthening Ff the westerlies north or the storm ‘from 
Poem b to 250 mb. The “Serong, positive meridional flow 
northeast of the storm at 400 mk and 250 mb (Figs. 4 and 6) 
could be an indication of a possible outflow channel, which 
kas been shown to be Favonuaeren TOF tLrooical cyclone 
intensification (Chen and Gray, 1984). 

miner low-level cyclonic and upper-level anticyclonic 
circulations in the mean wind fieids are generally represen- 
tative of mostly mature cyclones. It 1S recognized that 
computation of the mean fields was not restricted to cases 
for which the developing cyclone had matured to tropical 
storm intensity or to cases of intensifying cyclones. 
Nevertheless, Figs. 1, Ss anedeo amdicate ~ patterns of the 


vertical shear of zonal wind that nave been associated with 
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tropical cyclone maintenanmee (Gray. Ji272) The vertical 
shear of the mean zonal wind near the cyclone center is not 
large and changes Sign across the center. The . sheaneis 
positive to the poleward side and negative to the equator- 
ward side of the cyclone. Also, the line of zero Zoned 
vertical shear crosses near the cyclone center. 

These mean wind fields thus show that the GBA are 
capable of representing the flew around tropical cyclones. 
In the next chapter, the method of uSing EOFS to reoreseonmme 


this flow for all the cases in the Sample will be described. 
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tiie ooeineeaL ORTAECGONAL FUNCIIONS 


ee ce ee eee ee ee — <P ee ce ee ee = ee ce ee ee ee ee -e 


A. BACKGROUND 


The general application of é€igenvectors in an EOF anal- 
ysis is similar to the representation of a field in terms of 
orthogonal functions. While orthogonal functions are gener- 
ally simple functions such as sines and cosines, eigenvec- 
tors are derived from the data fields. After suitable 
ranking, a few eigenvectors may represent a Significantly 
nigher proportion of data variance than would the same 
Number of orthogonal functions. The statistical methods 
known aS principal ccmponent analysis and empirical orthog- 
onal function analysis (also referred to as empirical eigen- 
vector analysis) are in essence the same. The principal 
components are the same coefficients that would be derived 
meom dn FOF analysis. 

The EOF analysis is an objective, mathematical procedure 
which starts with éither the correlation or covariance 
DeameetX Of the original data matrix. Pron tiene sOSsS—pLoduce 
matrix, the eigenvalues and eigenvectors are derived. the 
hormalized eigenvectors form a complete orthonormal basis of 
vectors which can be used to refresent the original observa- 
tions. It will be shown that the reiative magnitudes og the 
elgenvalues can be used to frank-order the eigenvectors 
(modes) in terms of their significance in representing the 
Gaal. Furthermore, the most significant eigenvectors (that 
is, those which represent the greatest percentage of vari- 
ability in the data) can often te identified with physically 
important patterns in the original data. ‘While not impor- 
tant for this study, it 1s noted that data containing recur- 


rent temporal variations have Spatial eigenvectors whose 
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expansion coefficients are a time series representation Of 
these temporal patterns (Hardy and Walton, 1978; Legere 
To OSes 

Examples of the use of eigenvectors (eigeniodes) in 
meteorological applications include tnaose of Lorenz (1956) 
in statistical weather prediction, Grimmer (1963) in van 
analysis of temperature patterns in Europe, and Mateer 
(1965) ain an analysis of observations of ozone distribution 
from sky-light intensities. Many other studies can be found 
in the meteorological literature. Hardy and Walton (1978) 
gave a broad survey cf possible applications of the analysis 
GF "scalar data. The mathematical details of the scalar EOF 
method are described in the next section. 

There are particular advantages afforded by an EOF data 
analysis. It is not necessary for the data to be stationary 
(in a statistical sense), nor do they have to be uniform 
Sampled in space or time. The EOF method is a convenient, 
cost-effective and objective means to represent large 
amounts of synoptic data by comparatively few coefficients. 
While numerical storage is nct normaily a problem with 
modern computers, it is important that the researcher be 
able to represent synoptic fields in a "compact" manner. 
Also, these coefficients can ke readily incorporated into a 
regression analysis. Kutzbach (1967) sives a partéiculame 
clear description of an EOF analysis that was used to reduce 
23 temperature observations at 25 grid points to £lve Gigs 
vectors which accounted for 88 percent of the total varia- 
tion. Similarly, Stidd (1967) ferforaed an ZOF study of the 
average monthly rainfall in Nevada and was able to account 
for 93 percent of the total variance using only three eigen- 
vectors and coefficients. The elgenvectors were success-— 
fully associated with factors related to rainfall. These 
examples demonstrate the effective use of EOF analysis for 


data beducrien and for possible IGentl& ican om 
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meteorological forcing patterns. Because of their inherent 
empirical nature, it is not reyguired, and hence not always 
found to be the case, that the eigenvectors have a physical 
interpretation that accounts for any variation of the field 
being analyzed. 

Application of the EOF methcdology to wind data has been 
done in various manners. PCEpmgexanple;s sarnett (1977) 
applied an ECF analysis to Pacific trade wind data separated 
into zonal and meridional compcnents. The result was an 
analysis of two separate scalar fields. Alternately, the 
treatment of the wind as complex numbers for the EOF tech- 
nique was presented by Hardy (1977). Hardy and Walton 
(1978) analyzed mesoscale wind vector measurements at ten 
Peaetons in tke San Francisco Bay Area. This report 
included a useful mathematical appendix describing the anal- 
ysis of complex (that 1s, vector) data, since HOF analysis 
or two-dimensional vector data is achieved by use of complex 
rather than real numbers. This extension of the nethodology 
tomeseralghtiorward. The tiwe series analysis of the 
temporal component patterns was also illustrated. Results 
OF) this study confirm that ECF analysis can be advanta- 
geously applied to large sets of freyional wind velocity 
etd! The method objectively derived the essential spatial 
and temporal properties represented by the data, and enabled 
a quantitative development of "prototype" cases and a guan- 
titative comparison of regional velocity patterns on a 
iomien=-tO-month basis. This 1S Similar to the application oz 
PEOtesanalysiS LOr map typing. Fcr example, Brown (1981) used 
EOF methods to divide height fields surrounding tropical 
cyclones into smaller classes Eased on the derived coeffi- 
cients. These classes were used for an analog scheme to 
forecast tropical storm movement. 

Legler (1983) applied the method of Hardy and alton 


{1978) to 18 years of mnonthly-average recorcs of tropical 
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Pacific wind vectors. The complex SOF results linked the 
spatial and temporal patterrs of the data fields. This 
fusion of Space-time variations iS particularly useful for 
long-term records over large sfatial areas. The temporal 
variance of tae data was partitioned into orthogonal spatial 
patterns (the eigenvectors). The complex coefficients 
computed were shown to be a time series modulating the 
eigenvectors which were associated with physical patterns 
(Signals) that accounted for a large percentage of the total 
variance. Legler further demonstrated that it is possible 
to obtain statistical informaticn that could not be ootamied 
using a scalar analysis of the wind components. 

Whether to perform a scalar or a vector SOF analysis is 
a fundamental consideration. Kjelass (1971) has edited 


several articles on the theory and methodology o£ scalar and 


vector EOF analysis. These articles include illustrative 
examples of the applicatlon mos the methodology if 
geophysics. For vector data, a more realistic representa- 


tion would be expected from a vector EOF analysis, as dénon- 
strated ty Legler (1983). However, it would be rasn to 
assume that a vector analysis is necessarily best for vector 
data. As discussed in Chapter IV, the meridional and zonal 
wind components compriSing the data for this study were 
subjected to a scalar EOF analysis. The mathematical pree. 


dure for this analysis is described in the next section. 


Be THE EOF METHOD 


Let A be an m@ X N matrix containing n cases of m-variate 
data. The following development will be for the scalar EOF 
analysis of the standardized data matrix 2Zwith elements 


Z(i,j3) defined by: 


Z(1.)) > f 42{173) 93>) 7S ’ (3.1) 
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where a(i,j) is the corresponding element of matrix A, and 
b(i) and s(i) are respectively the mean and standard devia- 
tion of the elements in row i of matrix A (that is, the mean 
and standard deviation at a DabciciidimmnpOtnt “OL "tne 
Pela@cistant Ghid computed over ail cases.) The elements of 
Z are dimensionless variates of zero mean and standard devi- 
ation one. The main advantage of using standardized data is 
that it erfectively treats the systematic variation in 
Magnitude of the elements of the data matrix A. Dhl es 
kenefticial for the reasons given in Chapter II. The same 
Systematic error can occur with the use of the covariance 
Matrix, which also introduces the need for dimensional 
Mmemerng tO Lbeturn fo the form of the input data prior to 
interpretation of the eigenvectors. A disadvantage of using 
Wm@emeOrrelation Watrix 1S potential, but slight, smoothing 
Semone results (Kutzkach, 1967). 


The correlation matrix (R) then is the symmetric natrix: 


Pees /n p (332) 
where n 1s the number of cases, and a prime is used to 
denote the transpose of a Matrix or vector. Next, it is 


necessary to determine the follcwing constrained maximun: 


Max f{y: ete = 1} where y = e'Ffe (354) 
for the mu dimensional column vector e. The scalar y is the 
correlation between vector e and the data matrix A. The 


constraint requires that the vector e be normalized to 
length one. Morriscecn (1967) applies the method of Lagrange 
Hee oL1ers to (3.3) to obtain: 


ene ec. f— 1-O ’ (3.4) 
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where v is the Lagrange multiplier, i the identity mare 
and o the null vector. Wonutrivial solution of (a8 


requires v to satisfy: 
}R - vi] = O . (3259 


The values of v are thus the eigenvalues of the correlation 
Matrix RK, and e is the associated (normalized) eigenvector. 
Premultiplication of (3.4) by e' and ~a0plication ora 


constraint ete = 1 from (3.3) gives: 
VC Comme (3.6) 


Since v was chosen to maximize this correlation, v must be 
the largest eigenvalue of R. Morrison (1967) extends this 
constrained maximum method to show that the m eigenvalues of 
R account for the variance in eack of the m dimensions. en) 


the following discussion, the eigenvalues are ordered such 


Di pede 3 (3 28 


Also, the importance of the ith eigenvalue is measured by 


™ 


R 
Lp = 2; / Pa 


we 
2, / trR , (3.8) 


where Le is the fraction of the total variation Sing 
accounted ror by the eigenvectors associated with the k 
largest eigenvalues. The trace of the correlation matrix 
(tc R) 1s equal to its order (nm). 

Any of the input data cases (stored ina particular 
column of A) is "reproduceable” by application “of the meem 


coefficients defined by: 
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ef A ‘ (3-9) 


mere CG 1S an @ X Nn Orthogonal matrix and Eis the axon 
orthonormal matrix of the eigenvectors. Waterless) 1S formed 
such that column j hclds the ncrmalized eigenvector associ- 
ated witk eigenvalue j. Since E is orthonormal, (3.9) gives 


directly that the data matrix A can be recreated as: 


A= EC. (3510) 


fide, the EOF analysis results in a factorization (3.10) of 
the data matrix A. Matrix E of elgenvectors represents the 
spatial decomposition of the data variance into orthogonal 
modes. The coefficient matrix C accounts for the temporal 
variance. 

Phe replication of the data matrix Ais exact. The 
potential for application of the analysis with independent 
data is discussed in Chapter IV. It 1s noted that exact 
Heproduction is not jfossible fcr cases not in the develop- 
mental set of cases. Such a recreation would not be 
possible using a finite sum of functions of an orthogonal 
family. Case j (stored in column j of matrix A) 1s repre- 
sented by a linear combination of the orthogonal coeffi- 
clients and eigenvectors: 


mn 
a(j) = a c(1i,j) ee (3) BO ieee s 5 A ? (35° Vay 

where a(j) 1s the column vector j of matrix A, the c(i,j) 
are elements of the coefficient matrix C and e(di) See iS 
eugenvector in column i of matrix ¢. 

A word of caution should be given here. Dive *ractoml: za 
tion (3.10) is unique up to the coefficient signs since the 
coefficients are computed such that the variance is parti- 


tioned orthogonally into successively smaller portions. 


Zi, 


This uniqueness results Since the partitions £ormed eae 
distinct. The researcher might pe tempted to use orthogonal 
(matrix) ransformations on the coefficient matrix in an 
attempt to simplify the interpretation oc the Subjeer 
matter. The transformed matrix will generate the original 
data just as exactly as before; however, the eigenvectors no 
longer represent the same maximum percentages of variance. 

Lis, as generaliy found that an adequately large 
percentage of the total variaticn in R (and hence in A) can 
be attributed to the first p eigenvectors such that pis 
much smaller than the total rumber of eigenvectors (nm), 
particularly when m is darge '~(toneisony 7677 Case j is 
then approximated by: 


LP 
a(j) = O.cli,jpee(} for j= 1, .-- MN - (3. 12) 
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It is possible to recreate the input data elements of matrix 
A from the standardized matrix Z. If the first p eigenveds 


tors are retained, then (3.11)1i¢ approdzimated by: 


or 
a(i,3) = 2 [ etka) eae ce eran ee (3. 13) 
k=| 
where the e({i,k) are elements of the eigenvector matrix &. 
Shaffer (1982) discussea the rotation of eigenvectors 
computed in an EOF analysis. He gave a very simple 


example cf rotation and contrasted orthogonal Cotation woe 
obligue. The possibility that unrotated eigenvectors may not 


represent the true synoptic patterns was also explored and 


evidence given that this should not occur for true 
geophysical synoptic fields. Rotation was not performed on 
the eigenvectors in this study for several reasons. First 


the eigenvectors were needed to generate the coefficients to 
be used for the regression analysis. As such, the ability 


to interpret physically the eigenvectors is not as important 
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as in studies in whicn this iS a major objective. Gas 
example, Legler’ Ss (1933) auld) Opmetne trOolGal Pacific 
trades showed that rotation of the resultant eigenvectors 
can be essential to interpreting the patterns as well as to 
simplifying the statistical analysis ot the data. Second, a 
goal of this study was to reduce the data required for fore- 
casting. This was done by analyzing the amount of variance 
accounted for by the various eigenvectors. Were the eigen- 
vectors to have been rotated, they would no longer account 
tor the same percentage of the total variance. BOE -fLubtnes 
discussion cn the rotation oz €igenvectors, the reader is 


referred to Richman (1981). 


Seo lL ECTING THE NUMBER OF EIGENVECTORS 


Ohne important advantage of the EOF technigue is that of 
summarizing most of the variation ina multivariate systen 
in terms of fewer variables. Uniess the system is defective 
(less than full rank), some variance will always be unex- 
plained if fewer than mn, the row dimension of the data 
Matrix A, are taken to descrite the systen. The problem 
faced by the model builder is to determine the number of 
eigenvectors to provide a parsimonious, yet fairly adequate, 
description of a data systen. Various methods have been 
applied to determine how many é€igenvectors are significant; 
inate 12S, possess maximum infcrmation with minimum noise. 
The classical methodology outlined by Morrison (1967) is 
based upon the asymptotic behavior of the eigenvalues. This 
approach operates on the assumption of a large sample of 
normal data. If standardized data are utilized, the 
sampling statistics are considerably more complex (Anderson, 
eo3) . Preisendorfer and Barnett (1977) observed that this 
method 1s generally not suited to geophysical studies in 


which sample sizes are too small to have the reguisite 
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asymptotic behavior. Shaffer (1982) Found the asymptotic 
assumption to be invalid for his study of 504 cases (wath 
120 data “points seach of geofotential heights. Another 
alternative is to use the LEV (Logarithmic EigenValue) 
diagram method (Rinne and Karhila, 1979) which identifies 
those structural differences of the eigenvectors that 
describe noise instead of signal. Although this method is 
Simple, it is unsatisfactory because of the Subjectivaley 
required on the part of the researcher andthe lack of a 
strong theoretical basis. Other methods such as’ those of 
Richman (1980) or Brown (1981) are also rejected because 
they are too subjective in their applications. Methods such 
aS presented by Cattell (1958) and Guttman (1954) are 
considered unsuitable because of the danger of prokable 
overfactoring and their lack of a Scientific basis. 

The method used in this study 1s a Monte Carlo approach 
(Preisendorfer and Barnett, 1977). This approach was chosen 
over Morrison's (1967) because: (1) ait does not remit. 
asymptotic behavior of the eigenvalues; ({2) it is objective; 
and (3) it is based on statistical methodology. The f1rse 
step in this method is to generate at random a large number 
(at least 100) of data fields consisting of standard nornal 
deviates, which are then assembled into a matrix Z. Mat ris 
Zis therefore assumed to represent a data matrix obtainable 
if all processes are purely randon. Next, the eigenvalues 
are computed for each matrix Z2 Means and standard devia- 
tions are determined for the simulated eigenvalues. The 
eigenvalues obtained from the physical data are compared 
with those from the simulated deviates. If the true eigen- 
value deviates from the mean cf the corresponding randon 
data eigenvalues by more than two (three) Standard devia- 
tions, then the true eigenvalue is significant at the 95 
percent (98 percent) confidence level (Preisendorfer and 


BarnetG, 1977). That is, deviation of the true eigenvalue 
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from the mean Simulated eigenvalue by at ieast two standard 
deviations is indicative that the associated eigenvector 
represents signal rather than ncise. Bow SUuCecessiVve COL Ti- 
cients are computed, a running sum (using (3.12) or (3.13) 
aS appropriate) can be formed and compared with data matrix 
A to determine how well the data matrix is being generated 


by a smaller number of modes. 


on 


IV. RESULTANT EMPIRGTCAL SORT EOCU wea Ue LoNs 


As STATISTICAL AWALYStS 


The mathematical and theoretical framework developed in 
Chapter III was used for a scalar EOF analysis of the depen-= 
dent data set (682 cases as described in Chapter II). The 
Major purpose of this phase of the data analysis was to 
compute the EOF coefficients needed for the tropical cyclone 
motion forecast scheme proposed in Chapter I. Since these 
EOF coefficients were needed for use as possible predictors 
in separate regression equations for zonai and meridional 
storm movement, a scalar rather than vector representation 
waS considered to be adeyuate. For each of the zonal and 
meridional wind fields at 700 mab, 400 mb and 250 mb, a 
527 x 6382 data matrix A was fecrmed using the interpolated 
fields as columns. A matrix Z of standardized data. 
computed for each matrix A, and the resultant eigenvalues 
and corresponding eigenvectors were determined. For each 
wind-component field, 527 modes (eigenvectors) were gener- 
ated. The EOF coefficients for each of the 682 caSes were 
also computed for each of the six wind-component fields. 

The eigenvalues and cumulative percentage of total vari- 
ance for the zonal and wmeridicnal fields are presented, 
pressure level in Takles [i throughea The eigenvalues, 
and hence the significance cf their associated modes, 
decrease rapidly with increasing mode number. Zonal—iekse 
eigenvalues decrease at approximately twice the rate of 
decrease of the meridional eigenvalues. 

Although many modes resuited because of the number of 
grid points per case, most of the higher order modes repre- 


sent noise Bather than sigqnaa. To determine the number o£ 


a2 


modes to be retained, a Monte Carlo simulation was run as 
S@esecribed in Chapter III. A random number generator for 
standard normal deviates was used to simulate 100 standard- 
teed) 52/7 x 682 data Matrices Z. The Statistical "structure" 
of these random fields parallels that of the standardized 
fields of the real data. For each of the 100 simulated 
matrices of 682 cases of randcm standard scores, the EOF 
analysis was performed to yield 100 sets of 527 eigenvalues 
(one per field grid point). The means and standard devia- 
tions of the Monte Carlo eigenvalues were computed. If the 
elgenvalue for a mode computed from the real data was 
greater than the corresponding mean eigenvalue fplus twice 
its standard deviation as computed from tne random data, 
then the eigenvalue and eigenvector from the real data were 
selected as representing atmosfheric signal. The corre- 
sponding mode was retained at the 95 percent confidence 
level. Table V contains the mean eigenvalues of the Monte 
Carlo simulation as well as these mean eigenvalues plus 
twice their standard deviation. 

Comparisons of the six sets of real-field eigenvalues to 
those of the random fields are performed separately since 
the number of significant eigenvectors may be different for 
each level. The only relationship between the modes of the 
Six fields for the three levels comes from any vertical 
coupling that may exist. Fig. 7 illustrates the eigenvalues 
Semeeene 700 mb Zonal wind field and the Monte Carlo sinula- 
tion for the first 40 modes. Twenty-four modes are indi- 
cated to represent signai. Table VI is a summary of the 
humber of modes to be retained and the percentages of total 
variance described according tc the Monte Carlo selection 
emi terion. Some general observations can be  anade. FOr 
either the zonal or the meridional flow, the number of modes 
that represent signal at 700 mb is less than that at 400 mb, 


Wien If turn is less than that at 250 mb. FOr anye@om, tive 
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levels analyzed, a smaller numker of zonal modes than neri- 
dional are retained witn a higher percentage of total vari- 


ance represented. 


B. INTERPRETATION OF RESULTS 


The percentages of variance unexplained (noise) ts )ream, 
istic for a tropical) wind anaes: Errors are largely due 
to data distributions of measurement errore. The analysis 
problem is difficult because of the weak governing mass-wind 
balance relationship in the trcpics (Haltiner and Williams, 
1230) Therefore, it is plausible that the level of randon 
error in the wind-ccmponent fields is as high as 18.3 
pereente This maximum percentage of "noise" (for the meri- 
dional wind fields at 700 mb) corresponds to the largest 
number of modes (35) selected tc represent "Signal". 

In the subSeguent regresSion analysis, only the first 35 
modes of the zonal and meridional wind fields will be used 
in the development of the corresponding zonal and meridional 
storm mevement equations for é€acn of the three pressure 
levels analyzed. The retenticn of 35 modes for each wun 
component field provided the maximum possible selection of 
modes without including unnecessary noise. Using only ie 
coefficients out of 527 is a remarkable data reduction of 93 
percent. For each field it 1S necessary to store only the 
eigenvector matrix E and the first 35 coefficients for each 
case, Which will account for no less than 81.7 percent o£ 
the total variance. Table VII lists the percentages of 
variance accounted for when 35 modes are retained for all 
wind component fields. At least 90 percent of the total 
variance of any zonal wind field is accounted for. While 
the number of EOF coefficients needed is much larger than 
the 10 per case in Shaffer (1982), 35 modes per field is 
Still a tractable number of potential predictors for regres- 


Sion analysis. 
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Peet omepenetictal tem investigate the physical signifi- 
cance of the modes determined tc represent Signal. Shaffer 
(1982) found that the broad-scale features of eigenvectors 
derived from geopotential height fields had meteorological 
meaning. Contours of modes one and two (multiplied by 100) 


for the 700 mb zonal and meridional fields are presented in 


medica co and 9. The eigenvectors are non-dimensional, since 
standardized data were used for the EOF analysis. Two 
points must be stressed. Rese, there 1S no mathematical 


connection between any zonal-field mode and the same mode o£ 
the meridional field. That is, it is not possible to regain 
the vector nature of the wind Ey a combination of zonal and 
meridional eigenvectors. Secord, each eigenvector repre- 
sents the pattern shown as well as the exact inverse of the 
Paieeern.sc FOL a given field, the forcing pattern of a parti- 
cular eigenvector is dependent upon the sign of the associ- 
ated EOF coerficient. If the coefficient 1S nejative, then 
the forcing pattern of the eigenvector is "inverted". 
Peeve (Negative) Components or the field are reversed to 
negative (positive). The following discussion will use 
€lgenvector patterns aS shown without considering the 
inverse patterns. 

mre patterns of the 700 mb rodes 1 and 2 in Figs. 8 and 
9 can be interpreted separately as possible atmospheric flow 
patterns. Mode 1 of the 700 mb zonal flow (Fig. 8) shows a 
cyclonic Shear across the cylone, With easterlies to the 
north of the cyclone and westerlies to the south. Mode 2 of 


the 700 mb zonal flow (Fig.8) is dominated by broad easterly 


TOW The zOnal modes 171 and 2 at 400 mb and 250 mb 
(not shown) are characterized by diminished equatoriai 
westerlies to the south of the cyclone. Modes 1 and 2 of 


the 700 mb meridional flow (Fig. 9) both show alternating 
bands of positive and negative flow. These patterns are 


typical for trough-ridge-trough arrangements. Speed maxima 
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in the Eands are located north ch the veveron The cyclone 
is again located in a reyion of cyclonic shear. Meridional 
modes 1 and 2 at 400 mb and 250 mb {not shown) depict the 
eastward slope of the 700 mb patterns with elevation. These 
modes, which individually acccunt for the Targest peneaaee 
ages of total variance in their corresponding fields, 4£e 
indeed patterns or Signals that appear to relate to tropical 
cyclone movement. 

Complexity of the eigenvectors made it difficult to 
associate ofservable atmospheric patterns with higher order 
modes for any of the fields. Legler (1983) has observed 
that examination of the eigenvectors to give appropriate 
physical interpretations May — be  abepedetaca for @anaa 
collected over large grids. Over large areas, signals from 
two or more physical processes may be overlaid ina single 
elgenvector. This can occur since there are no restrictions 
as to how the patterns for a particular process may be 
"iecomposed" among the eigenvectors. Moreover, particularly 
strong atmospheric signalS May appear in more than one 
elgenvector. Under such circumstances any realistic inter- 
pretation of the modes may be precluded. 

It is also important to verify that the significant 
modes selected for retention do satisfactorily represent the 
data fields. A case (0000 GMT 30 July 79) was selected at 
Fandom to demonstrate the tLreccnstruction Capability soem. 
EOF analysis. At this time, Typhoon Hope was at approxi- 
Mately 16.9 N, 133.4 E with maximum sustained winds o£ 
38.6 m/s (75 kts) The actual zcnal wind field at 700 mb and 
the reproduction by summing all 527 modes are shown in 
Pig Ore The reproduction of the original field is seen to 
be exact. If the eigenvector matrix were to be used to 
generate coefficients for.a case not included in the depen- 
dent sample, the reproduction froduced by summing over ali 


modes would not be exact. The fields obtained by summing 
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mmeomerinse 5, 15, 25 and 35 are shown in Figs. 11 and 12. 
When only five modes are summed, only the gross patterns 
(positive flow versus negative) are reproduced. Yet, lies 
interesting to observe how only five coefficients and modes 
can begin to recreate a particular field using eigenvectors 
derived from all 682 cases. As the number of modes 1s 
increased, anh increasing amount of the complexity of the 
original field is rerflicated (Figs. 11 and 712). In the next 
Chapter, the EOF coefficients derived for the zonal and 
meridional wind fields will be used as potential predictors 
representing the synoptic-scale forcing in a stepwise 


regression procedure. 


a 


Vo REGRESSION ANALYSES 
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A. MOTIVATION 


Regression analysis is one of the most widely used 
Statistical tools; Its essence is the study of relation- 
ShipS among variables’ to serve three major purposes: 
description, control andepredreccion, The researcher's goal 
is to find a simple mathematical model that, on the basis of 
observed data, will fit a complex phenomenon. An excellent 
presentation of theory and methced that is conducive to prac- 
tical application is given by Neter and Wasserman (1974). A 
more advanced presentation of statistical theory of the 
complete general linear model is given by Graybill (1976). 
Briefly, regression analysis involves using a linear combi- 
nation of known quantities (predictors) to estimate the 
value of an unknown guantity (predictand). 

EOF coerficients have been demonstrated to give a 
convenient, quantitative representation of physical forcing 
mechanisms acting on tropical cyclones (Chapter 4) 
Previous studies (described in Chapter I) have shown that 
Statistical forecast schemes based on regression equations 
acre viable methods. in? partieular, it is possible to use 
EOF coefficients based on geopotential heights as predictors 
to forecast tropical cyclone movement (Shaffer and Elsberry, 
1982; Shapiro and Neumann, 1984). The hypothesis here is 
that the SOF coefficients derived to represent wind forcing 
of a tropical cyclone would be useful predictors of uri 
storm rzovement. 

Western North Pacific tropical cyclone position tonecea an 
errors for 10 years {1966=1975) have been statistically 


analyzed=(garcell et ale 1g The examination o£ errors 


30 


revealed that a small numkter of readily available parameters 
can classify, With reasonable effectiveness, a tropical 
cycione forecast aS representing a group of storms with 
elther markedly above or pelow average errors. These vari- 
ables include storm location, maximum Sustained wind and the 
components of motion. Thus, it is hypothesized that these 
parameters might also be appropriate predictors of tropical 
cyclone movement. Regression analyses were performed to 


investigate these hypotheses. 


Dee VARIABLE AND CASE SELECTION 


A primary goal of any reyression analysis 1S to choose a 
set of independent variables that is "best". Here the 
criterion "best" is defined aS Minimizing the sum of syuares 
Sie residuals without overfitting. Practicality requires 
that there be a scope of the model; that is, the coverage of 
a model is restricted to some region or interval of values 
of the independent variables. Model coverage is determined 
by the dependent cases included in the analysis. Possible 
Gieeerculties are considered later in this chapter. 

Predictands for this study are the average 24-, 48- and 
72-hour zonal and meridional translation speeds of tne trop- 
ical cyclone. These average speeds were determined from the 
base-time JTWC warning positicn and the subsequent JTWC 
Warning position at 24, 48 or 7z hours. Positive motion was 
defined to the north and to the west, since the majority of 
tropical cyclones tracked to the north and west. As there 
are six predictands, six regression equations are required 
for each of the pressure levels included in the study 
(700 mb, 400 mb and 250 mb). A total of 18 eguations was 
derived. 

It is emphasized that the predictands were computed 


uSing JTWC warning fpositions at both base time and the 
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forecast times. In the subseguent discussion, comparisons 
of the results obtained in this study for tne dependent data 
are made with those obtained for the dependent data in 
Shaffer's (1982) study based on an EOr analysis of geopoten- 
tial height. Shaffer's predictarnds were computed using the 
JTWC warning and best-track positions at base and forecast 
times respectively. 

Predictors were sought to assess quantitatively the 
effect of five factors on tropical cyclone movement: 
(1) external (to the cyclone) physical forcing; (2) previous 
cyclone movement; (3) cyclone intensity; (4) date; and (5) 
Lie tad ee(Wa rl 2G) Ost tome Table VIII describes the 83 
potential predictors used for the regression analysis and 
identifies these predictors by name and number. The poten- 
tial predictors were identical for all 18 regression egua- 
tions, except that the regression ejuation for a specific 
level included only the EOF coefficients at that level. 

Synoptic external forcing on a tropical cyclone has been 
conjectured to be an important determinant of cyclone move- 
ment (Brown, 1981; and others). To incorporate Guantita- 
tively the wind ftoremng, the EOF coefficients associated 
with the first 35 zonal and meridional modes were selected 
as potential predictors. These coefficients are potential 
predictors 14 through 83 (CU1 through CU35 and CY? thrown 
CV35)— Vim table Vili. An important objective of this study 
waS to evaluate how well these EOF coefricients represented 
atmospheric features that affected cyclone movement. 

Persistence has long been known to be a good predictor 
of short-term tropical cyclone=nocton. Therefore, hine 
potential predictors representing past zonal and meridional 
motions were included. These were variables 5 through 13 in 
Table VIII. Each prior average speed or vector displacement 
was based on JTWC warning positions to simulate operational 
conditions. 
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MiemEeccente MNetLhon Of the Storm 2S an integral part of 
the prediction model of neariy all tropical cyclone forecast 
methods. Most cyclones move with uniform direction and 
Speed (Gray, Neo TO.) SdimibaecderOhy | LOLeCcCaStS OL tropical 
cyclone movement can be based mainly on extrapolation and 
mEamatclogy. Because there are relatively few storms with 
anomalous tracks, predictors based on present and past nmnove- 
ment tend to dominate a statistical analysis of storm 
motion. These "difficult" storms, which are associated with 
above-average forecast errors, tend to recurve or to move 
erratically with nonclimatoloyical tracks. A persistence- 
@eemmatology rorecast leads to large errors for the 20-25 
percent of the cases of anomalous motion (Gray, 1978). When 
there are not many storms during a season, a Single anona- 
louS storm can result in a sSignhificart bias of the yearly 
mean forecast error (Neumann, 1581). 

The remaining potential predictors are related to obser- 
vaticns of the tropical cyclone at base time. Tio pisea: 
cycione intensity (potential predictor 4, Table VIII) was 
the JTWC warning maximum sustained wind speed at base time. 
The Julian date and the JTWC warning position latitude and 
Hemmeartude {potential predictors 1, 2 and 3, Table VIII) 
completed the set of potential independent variables. 

The 682 cases from the EOF analysis were used to select 
toe cases for the regression analySis. For a case to be 
ancluded, a complete set of pctential predictors had to be 
maeeable. in addition to availability of the GBA, the JTWC 
reports had to be available at 12 and 24 hours prior to base 
time and at least 24 hours subsequent to base time. 
Similarly, selection of that case for development of regres- 
Sion equations for 48- or 72-hour forecasts required that 
JTWC warning positions be available at 48 or 72 hours, 


respectively. These reguirements decreased the number of 
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cases for regression analysis. Sample sizes were 409, 308 
and 232 cases for the 24-, 48- and 72-hour equatione, 


respectively. 


C. THE EQUATIONS AND ERROR ANALYSIS 


A linear stepwise regression analysis was chosen to 
derive eguations to fredict future average zonal and meri- 
dional speeds of the tropical cyclores. Although an Wa 
priori assumption of linearity could not be made, the number 
of polyncmial predictors generated from a base set of 83 
potential predictors would have been antractabile The UCLA 
biomedical computer program BMDE2R (Dixon and Brown, 1979) 
was used for the regressions. Multicoilimedrs tye. which 
occurs when some or allof the independent variables are 


highly correlated (Neter and Wasserman, 1974), was avoided 


by the use of stepwise regression. Multicollinearig 
fosters a large potential for overfitting since Many 
different models would provide the same good fit. Asa 


resuit, it becomes impossible to interpret any one set of 
regression coefficients as béing representative of the 
effects of the different independent variables. Also, the 
estimated regression coefficients usually have a very large 
Sampling variability so that they are imprecise and lose 
their meaning (or Significance). The BMDP routine inciudes 
a preset tolerance to automatically screen the variables at 
each ster. A potential predictor was not allowed to enter 
the model if it was highly ccrrelated with any predictor 
chosen in earlier steps. To ensure that a predictor was 
Significantly (in a statistical sense) correlated with the 
predictand, a minimum F-to-enter value of 4.0 was imposed 
(Dixon and Brown, 1979). 

The coefficient of multiple determination (R?2) is a 


measure of the association between the dependent and 
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imdependent variables in the regression model. This 


Statistic is defined: 


ore eon 7oS lO eet ( So5E/7SSTO ) ; (S905 


where SSTO is the total sum of Sguares, SSR is the regres- 
Sion sum of sguares and SSE is the residual sum of Squares. 
The R2 statiStic measures the proportion of the total varia- 
tion in the predictand associated with the use of the inde- 
pendent variables. The regression equations retained only 
those predictors which resulted in an increase in R? of at 
least 0.01. 

The value of R2# for each regression equation iS given in 
Table IX. Matching forecast times and pressure ievels, the 
value of R2 for a zonal equation is always at least 0.12 
greater than the R2 for the meridional eguation for the same 
forecast interval and pressure level. Shafrer (1982) found 
differences as large as 24 percent. The zonal regression 
ea@laterons account for a Greater portion of the total zonal 
movement variation than the meridional eyuations. This 
observation agrees with Shaffer (1982). At least 59 percent 
of the total variation in zonal movement was accounted for 
by the eyuations at each of the three pressure tJlevels for 
any forecast interval. Values of R2 for the meridional 
Pemiations range from 0.325 for the 250 mb 72-hour forecast 
Pome. +75 for the 700 mb 24-hour forecast. 

The greater predictive ability of the zonal eyuations 
was expected. First, it was shown in Chapter IV that fewer 
modes were required to descrire the zonal wind than the 
meridional wind. Second, there 1s greater variation in the 
zonal movement than in the meridional movement. The means 
and standard deviations of the average zonai and meridional 
speeds of the various forecast intervals are given in Table 


Ke Positive mean zonal and meridional components with the 
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same magnitude indicates mean movement to the northwest: 
The values of R? for the zonal ejuations were significantly 
greater due to the larger varialility im zonal me ,cnenm 

Values of R2 do not vary greatly with forecast interval 
for either the Zonal or meridicnal equations at any Gime 
pressure levels. The largest deviations are for the 700 mb 
zohal equations and the 250 mE meridional equations. ie 
contrast, Shaffer's (1982) regression equations consistently 
displayed a significant decrease inthe value of R@2 with 
increasing forecast interval (about 0.1 per 12 heli 
interval). These differences in the variation of R2 with 
forecast interval may account for differences in forecast 
error characteristics discussed later in this section. 

Finally, the accuracy of the Zonal or meridional cada. 
tions 1S not a strong Function Cr) pmeaciec. For either the 
zonal or meridional movement, the equation derived using the 
EOF coefficients for a given level does not perform signifi- 
cantly better (as measured by R?) than the equations for the 
other two levels. This waS Similar to results obtained by 
Shaffer (1982) for the dependent sample. Slightly largess 
values of R2 are found for the 700 mb zonal eyuations at all 
three forecast intervals. 

Tables XI through XVI Summarize the regression equa- 
tions. The first value in each table 1s the intercept. The 
average speed component (Km/hr) 1S obtained by summing the 
product of all non-zero regression coefficients and the 
values of the associated variables. Parsimony in selection 
of variables was met; the main purposes of retention of as 
few variables as possible were to obtain simple eguations 
and = to: avoid voverrziae tines 

several observations were made regarding the variables 
retained for the regression equations andthe order of 
Selection: A past movement variable (predictors 5-10 in 


Table VIII) was the first variable selected in the stepwise 
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megcegure Lor 15 Of the 1¢@ equations, including all nine of 
the 24-hour forecast eguations. HOE eneomcondl. 4 O= 7.) ZOna. 
72- and meridional 72-hour forecast ejuations, the second or 
third variable selected was ror past moveazent. The 
Peed tetors UOLDZ, UOLL3, VOLDZ and VOLD3 (see Taple VIII for 
description) were most frequently chosen. These results 
were in agreement with Neumann's (1978) observation that 
statistical screening technigues invariably select present 
and past storm movement over steering predictors derived 
Paeme thie Surrounding £low for short-term tropical cyclone 
movement. However, the predictions are not simply persis- 
tence forecasts. Mode variables CU1, CUZ, CV1 and CV2 were 
often the second, third or fourth predictors selected. This 
waS not Surprising since the first 2 modes account for the 
ijargest percentages of variance In ere Snind=compor ent 
fields. P_OlpeeomeOulom ZONndh bCr COCLELCIeEnNE predictors and 
from 2 to 7 meridional EOF coefficient predictors were 
chosen for the forecast equatiors, so that wind forcing was 
also found to be an important determinant of tropical 
cyclone movement. 

Several potentiai predictors were not included in any of 
the equations: DATE, CINT, VOLD1 and DISP1. The potential 
predictor UCLD1 was retained in only one forecast eguation. 
These past movement variables represent the interval from 24 
tomers hours prior to base time. Very little information 
would be lost by exclusion of these potential predictors. 

The potential performance ci this regression forecast 
scheme was evaluated Fy testing on the dependent data cases. 
The following procedure was applied for the forecast inter- 
vals 24, 48 and 72 hours at each pressure level (700 mb, 400 
momemna 250 mb) . First, the appropriate equations were used 
to predict the average zonal and meridional speeds of the 
meopical cyclone. These speeds were converted to zonal and 


meridional displacements of the tropical cycione during the 
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forecast interval. Finally, the forecast Error was con ieee 
by determining the magnitude of the vector between the fore- 
cast poSition and the JTWC warning poSition az the comme 
sponding time. 

The forecast errors are Summarized in Tabie XVII by 
pressure level and forecast interval. It is stressed that 
these results were derived using only the dependent cases. 
As expected, the forecast errer increases with increasing 
length of forecast interval. Hcwever, the magnitudes of the 
increases are reasonable. The increase in the 72-hour fore- 
cast error over the 48-hour forecast error was much smaller 
than that s£0n°” °Shatter Ys aio) dependent Sample. The 
Smallest change for the current study was about 82 km less 
than for Shaffer's results. It waS previously noted that 
there waS arapid decrease in R@ with increasing forecast 
interval for Shaffer's equations. Shaffer's eguations 
predicted short-term movement well, but the errors grew 
rapidly with increasing time. The 24-hour forecast error 
for this study was about 25 ki larger than f0r Shaiteme 
dependent Sample. However, the best 48- and 72-hour fore- 
cast errors for the current study were 28 km and 90 km less 
than those of Shaffer. Stability of predictand variance for 
the current study resulted in models that give promise of 
improvement of long-term forecasts. 

There were no overwhelming dirferences in performance of 
the eguations derived for the three levels at any forecast 
interval. Shaffer's (1982) fcrecast equations based on an 
FOF anaiysis of geopotential height at 500 mb, 700 mb and 
850 mb also did not have significant differences in errors 
among the three leveis. dowever, Shafrer's 500 mb eguations 
outperformed the other two equation sets by a wide margin 
for a small set of independent cases. Although the 72-hour 
forecast errors in Table XVII are largest for the 250 mb 


equation, they still compare favorably with the JTWC mean 
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errors. These results do not appear to agree with Jordan 
(ho52) and Miller (1958) who were unsuccessful at using 
winds and heightS at upper trofospheric levels to describe 
tropical cyclone moticn. 

It was also important to examine the results for consis- 
tency in the forecasts. Consistency would be indicated by 
small standard deviations of forecast error. The standard 
deviations were generally comparable to Shaffer (1982). 
There were no significant differences among the standard 
deviations for a given forecast interval, except the 
standard deviation for the 72-hcur forecast using the 250 mb 
Pe@atron wWaS particularly smaller than that for either 


700 mb or 400 mb equations, 


D. CAUTIONS FOR USE OF THE REGEHESSION MODEL 


Varicus restrictions should be considered when applying 
the results of a regression analysis. The validity of the 
predictions depends upon whether basic causal conditions at 
later times will be similar to those in effect ror the data 
used for the regression analysis. The scope of the data 
must be respected to avoid inferences based on an indepen- 
dent variable which falls outside the range of input data. 
Finally, it must be remembered that the predictands used to 
derive the eyguations were computed using the JTWC warning 
positions at both the base time and at subsequent forecast 
times. 

The performance of the model as indicated by the depen- 
dent sample may be superior to the ability for new cases. 
Tnis is known as prediction bias, which results when the 
final model chosen is too uniguely related to the input data 
cases. It 1S emphasized that the models developed in this 


study have not been tested with independent data cases. 
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Multicoilinearity exists unless the variables (including 
the EOF coefficients) are completely pairwise uncorrelated. 
This rarely occurs acu When the independent vari- 
ables are highly correlated, the predictive ability of the 
model is suspect for new cases whose independent variables 
deviate from the pattern of multicollinearity in the depen- 


dent cases. 
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VI. POTENTIAL FOR USE WITH INDEPENDENT DATA 


Based on results obtained usiny dependent data and 
predictands derived using warning positions, EOF- based 
regression forecasting appears to have potential for 
improved prediction cf tropical cyclone movement. The value 
of the final model depends upcn its potential for opera- 
tional use with inderendent data. The regression eguations 
were derived using EOF coefficients computed using a farti- 
cular set of eigenvectors; nhnameéiy, the eigenvector matrix = 
of the dependent data set. These regresSion equations are 
applicable only for tropical cyclone cases within the scope 
of the nodel. The scope of the model is determined prinma- 
rily by the values of the predictors and predictands used to 
derive the forecast equations. EOF coefficients are the 
most sensitive predictors in that they are derived from the 
Pieerecular flow fields surrounding the tropical cyclones. 
FOr a new case, the eigenvectors no longer exactly represent 
the maximum variation in all of the observations--dependent 
set plus the new case. The stability of the eigenvectors 
must be examined by determining whether the eigenvectors and 
coefficients of the dependent data cases remain nearly tne 
Same if a new case is added. 

Inclusion of an additional case changes the correlation 


Matrix RK. The new correlation tatrix can be computed by: 


R(new) = [n/(nt+1) JeR(old) + [ 1/(n+1) Jezz'* P CG...) 


where R(new) 1s the new correlation matrix after addition of 
the new case, R(old) the original correlation matrix of the 
dependent data, n the number of cases prior to inclusion of 


the new case, and zthe m x 1 vector of standardized data 


4g 


for the new case. If n iS iarge, the term ({l/7n+1) an (Gee 
is negligible relative to the rirst term so that the 


following approxima tenes a ie 


R (new) ~ R (old) 3 (6 <5 


The eigenvalues and eigenvectors computed for the depen- 
dent data uSing R(old) should be almost identical to those 
obtained from computation using k&k(new). Provided that a 
sufficiertiy large dependent sample is available, it is 
reasonakle to use R(cld) to comrute the EOF coefficients for 
a new data case andthen to use these coefficients as 
predictors in the forecast equations derived with the depen- 
dent data. Shaffer (1982) determined that use of the coef- 
ficients for cases calculated using R(old) introduced very 
little error into the movement forecast. Testing is 


reguired to determine a sample size sufficient for (6.2) to 


be valid. The reader is referred to Saaffer for a detailed 
example On methodology aD EEODEiaiue to test these 
observations. 


Operational implementation of an EOF forecast scheme 


would be straightforward. TwO major operations are 
required. First, the 35 required EOF coefficiemts foreweme 
independent data cases must be computed and stored. This 


involves multiplication of the 35 x 527 transpose matrix of 
truncated eigenvectors and the 527 x 1 vector of standard- 
ized observations. It is assumed that no significant error 
would be associated with using the means and standard devia- 
tions from the dependent sample at the equidistant grid 
points. Second, these coefficients and other predictors 
would be substituted into the regression eguations to 
predict the average zonal and meridional speeds for the 
LOLCCa Ste peer ya le The predicted future location of the 


tropical cyclone could then be determined. 
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Assuming that the eigenvector matrix E was determined 
from an adeguate (large) dependent data set, the same set of 
eigenvectors can be used irdefinitely for independent 
cases (new tropical cyciones), within the limitations of the 


scope or the model. Shaffer (1982) recommended that the 


regression equations be updated at the conclusion of each 
typhoon season. The feasibility and necessity of updating 
can be questioned for the current model. Shaffer's cases 


reguired 120 data pcints per case as opposed to two fields 
Simeoz’ data points ¢€ach for this study. ir each case 
meeting selection requirements were added to the dependent 
data set, computing difficulties would be likely to beccne 
prohibitive after several tropical cyclone seasons. While 
it "might" be advantageous at least to include the anomalous 
cases, specific inclusion of ancmalous cases could seriously 
reduce the ability of the regression analysis to obtain a 
good fit. Shaffer (1982) also suggested that increasing the 
number of dependent data cases should result in fewer large 
forecast errors. However, a larger dependent data set does 
not imply a better fit (as measured by R?), nor does it 
imply that the model will better forecast anomalous cases. 
One alternative would be to have more than one set of 
regression equations. A map-typing or analog technigue 
could be used to determine which set of equations would be 
appropriate on a case-by-case basis. Suco an alternate 
method would lack Simplicity, which is one of the most 
attractive features of the current EOF-based regression 
forecast scheme. 

The forecast scheme using ECF-based regression models is 
very Simple compared with other more elaborate models. The 
model reguires only a set of ccefficients representing the 
Syhoptic-scale wind forcing and predictors representing 
present position and past storm movement. The entire fore- 


cast scheme could be executed using a minicomputer. The 


Su 


movement forecast could be generated upon input of the 
appropriate zonal and meridicnal components at the 527 
points. Operational implementation of such a statistical- 


climatological methcd appears tc be feasible. 
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CLUSIONS AND SUGGESTED RESEARCH 
The results described in this thesis must be regarded as 
preliminary. However, it apfears sufficiently promising 
that a viable, efficient regression scheme involving EOF 
coefficients to represent wind forcing can be developed. 
Two improvements are suggested before any operational 
testing might be performed. First, the predictands should 
be computed using the JTWC warning position at base time and 
the JTWC best-track positions at the predictand times. The 
best-track positions are based on a post-Season analysis 
uSing all information available. The use of warning posi- 
moms Lor the locations of the cyclone at predictand times 
unnecessarily contaminates the predictands. ites a pero 
priate to use the warning position to locate the tropical 
cyclone at base time because this is the only position 
available at the time of the fcrecast. Second, forecast 
error shculd correspondingly be defined as the deviation of 
eiew2orecast position £rom the kest-track position. 

Adoption of an operational forecast model reguires 
testing using both dependent and independent data. the 
EOF-regression forecast errors should be compared with fore- 
casts obtained by another operational model (such as CLIPPER) 
and of the JTWC. The ultimate utility of the model depends 
upon demonstrated forecast skill for operational data, 
regardless of prior performance on dependent data or indica- 
tions of a statistical Significance test. Resuits obtained 
in the present study indicate very good potential for an 


operational model. 


re 


The following discussion Suggests other possible 
research to improve the operaticnal modem 

1. The forecast scheme could be improved if other vari- 
ables representing physical features affecting storn 
movement could be identified and included in the 
regression equations. Intensity, represented by 
maximum sustained wind speed, was found to be an 
unimportant predictor on 2 bown this study and 
Shafter s ( 19ezie. Following Chan and Gray (19827)@ 
variables such aS the size of the cyclone should be 
tested in the regression analysis. Model verifica- 
tion of George and Gray's (1976) observation that the 
700 mb level Fest specifies cyclone speed and that 
the 500 mb level best specifies cyclone direction 
might be attempted. 

2. The EOF-based regressicn forecast scheme is not 
limited to input of coefficients derived from anal- 
yses. Coefficients derived from prognostic data 
fields, such asa 24-hcur forecast from a dynamic 
humerical prediction model, might improve the longer 
range forecasts. 

3. Each rOF coefficient refresents the contribution of 
the associated eigenvectcr to the total forcing. tThe 
resultant tropical cyclone movement iS a summation of 
the total forcing by | alipmodes: Additional insighe 
into the more important modes for tropical cyclone 
Forcing could possibly be obtained by examination of 
the correlaticn of the modes with the tropieae 
cyclone movement. 

4. Vertical coupling might be represented in the EOF 
modes for the three levels. Testing would involve 
the development and analysis of regression models 
using EOF coefficients from more than one pressure 


level in various combinations. 
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5. The zonal and meridional components of tropical 
cyclone movement are forecast separately by the 
current scheme, even though tropical cyclone movement 
1S a vector quantity. Ccrrelations exist between the 
zonal and meridional components of motion (Shapiro 
and Neumann, 1984). I@tprovements might arise fron 
Pie MicLOmeor ee DOLeM badly rmeadlCLOLrS which account for 
the correlation between the zonal and meridional 
components of motion. Also, ah operational nodel 
Might be improved using a grid rotated along the 
Gbectl one of Cyclone Smction {Shapiro and Neumann, 
1984). 

6. A vector EOF analySiS may improve the identification 
of forcing medes for tropical cyclone movement. 
Rotation of the eigenvectors could also be investi- 
gated for potential improvement of tne method. AS 
previously noted, more e€igenvectors may have to be 
retained to guard against underfactoring. 

The EOF-regressicn model definitely shows promise for 
improvement of operational forecasts or tropical cyclone 
movement. This Simple regression model performed very well 
on dependent data. Additional reductions in forecast error 
may be possible through inclusion of more sophisticated 
physical forcing parameters and prognostic fields. Further 


research appears warranted. 
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APPENDIX B 
TABLES 


FABLE LE 


Operational models fcr the prediction ; 
of tropical cyclone motion cver the North Atlantic. * 


TYPE Hepa s 


Sela oe Cras 


SIA Soltek 


DYNAMICAL 


DYNAMICAL 


DESCHAE Tien 


—_ = Eee ab =e ee =) ae OE =e ee See ee eee eee ee ee 


Ree eno es based on _ tracks of all 
Atlantic tropical cyclones since 
18862 (Operational 1968) 


Regression equation model utilizing 
predictors derived from Peres. 
and persistence. (Operational 197 { 


Regression eguation model utilizin 
predictors derived from climatology 
Pepe cones and observed geopoten- 
ial height data. 

(Operational 1906/7) 


Regression equation model utilizing 
pred oes derived from output of 
LIPER model and observed geopoten- 
tial height data. 

(Operational 1972) 


Regression equation model utilizing 
pee een dérived from output of 
LIPER model, observed and numeri- 
Cale, fOr geese r ot tae heignt 
data. (Operational 1973) 


Ba LO Gee ete te based on pressure- 
weighted wind field averaged 

orc ie props aes and represented 
on.a 154 km {at 22.5 a Spaced 

Cisne ce (Ove rataonal 1970) 

Movable Fine Mesh (MFM) baroclinic 
model a 10 levels in the ver- 


tical and 60 kn eee in the 
hori Zoreei. (Operational, 137 


1981) 
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TABLE Ea 


2: eigenvalues and 


ponent wind fields 
cumulative percentage of variance (in parentheses). 
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TABLE ALE 


eigenvalues and 


400 mb component wind fields: 1 
cumulative percentage of variance (if parentheses). 
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TABEESLY 


250 mb component wind fields: 
cumnulative percentage of variance (in parentheses). 
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Intercept and regression coefficients for the zonal 
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Intercept and regression coefficients for the meridional 
average speed equation using 250 mb EOFs. 
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